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The Origins of the Standards Movement in the United States: Adop- 
tion of the Written Test and its Influence on Class Work 


MIYAMOTO, Kenichiro* 


Every state in the United States, under the NCLB act, has set state stan- 
dards and is testing all students in grades 3-8. Students are given printed ques- 
tions to which they write answers with a pencil on an answer sheet. These 
written tests are usually given to determine the academic achievements of stu- 
dents. 

This paper traces the early history of the written examination and the 
change in the meaning of “standards” from the middle of the 19th century to 
the early 20th century. Although the meaning of standards was ambiguous a 
cen tury ago, by focusing on the written examination for promotion, we can in- 
vestigate the meaning of standards at differen t poin ts in time, i.e. what educa- 
tors expected pupils to achieve at primary school. 

It has been stated that (1) the written examination for promotion has 
forced teachers to accept external standards given by educational administra- 
tors; (2) it has also widened the distance between examinations and regular 
class work, and (3) “standard” had approximately the same meaning as “ norm ” 
in the educational measurement movement in the early 20th century. 

However, we need to examine the works of educators such as Emerson E. 
White and Joseph Baldwin, who, in the late 19th and early 20th century, were 
opposed to written examinations for promotion, and tried connecting written 
examinations and regular class work. Frank McMurry insisted at the time that 
standards should be the means for improving the curriculum and instruction at 
schools. 

Educational measurement did not continue for a long time afterward, and 
was criticized by many educators, especially progressives. The standards move- 
ment arose in the 1980s, in the midst of heated controversy surrounding exam- 
based testing that has lasted from the middle of the 20th century. Yet supporters 
of the standards movement are still struggling to solve the same problems that 
White, Baldwin, and McMurry > tackled a century ago, most notably, how best 
to connect instruction and examination. 

The answer to this problem lies in unification of the two. Educators now 
possess various methods and media for connecting examination and instruc- 
tion. However, the early history of the standards movement implies that educa- 
tors do not necessarily re-establish interaction and co-operation in class 
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between themselves and pupils as long as academic achievements of children 
are compared with standards external to teachers. Instruction does not occur 
without participation of teachers in setting standards and aims of instruction. 


1 Introduction 

A national survey of academic achievement was conducted in April, 2007 after 46 years’ 
interval in Japan. It was also done in April, 2008, and is supposed to be conducted annually for 
all pupils of 6 th grade and 9 th grade. The revival was caused by public anxiety concerning the de- 
cline in academic standards of Japanese children. They were said to have lost high rank in inter- 
national comparisons of academic performance as shown by their scores on PISA in 2000, 2003 
and 2006.' The Japanese Ministry of Education, Culture, Sports, Science and Technology (MEXT) 
revised the national course of study in March, 2008 and is encouraging every school to make ef- 
forts to raise academic standards and performance. 2 

The situation is similar to that of the United States since the 1980s. The standards move- 
ment, sometimes called “standard-based reform,” has been ongoing in the United States. 3 The fed- 
eral government recently began drawing up policy regarding setting and raising standards. 4 After 
the publication of A Nation at Risk (1983), 5 there appeared a great deal of literature accusing pub- 
lic schools of academic laxity, and calling for a more rigorous academic curriculum and higher 
academic standards in public schools. 6 There have been three influential documents: AMERICA 
2000: An Education Strategy announced by the 41 st president, George Bush, in 1991; Goals 2000 
enacted by the Clinton administration in 1994; and the No Child Left Behind Act of 2001, the leg- 
islation reauthorizing the Elementary and Secondary Education Act by the George W. Bush admin- 
istration. 

These three documents contain the common assumptions that higher academic standards 
are necessary for the nation, and that achievement tests should be used to ascertain the level of 
academic performance of students and schools. Though the federal government cannot intervene 
directly in educational matters in individual states, it can have an enormous influence on state ed- 
ucation policy by requiring states to implement statewide accountability systems. Under the NCLB 
Act, each state has set state standards, and must annually test all children in grades 3-8 in reading 
and mathematics, based on these standards. Not a few educators are trying to set national standards. 
These standards are not within the jurisdiction of teachers in that they are decreed to teachers from 
the top down, and educational achievements are tested and connected with rewards and sanc- 
tions. 7 

Although the federal government had paid scant attention to educational attainment before 
the legislation of the Elementary and Secondary Education Act of 1965, American educators have 
been concerned with the educational standards of public schools since the common school system 
was established in the 19 th century. Teachers have been asking what students should learn in the 
schoolroom or in the classroom since early times. Educational administrators, who appeared in the 
late 19 th century, have been asking what teachers as well as students should know. In the 20 th cen- 
tury, all educators interested in the standards movement were asking how students’ and teachers’ 
achievements should be measured, evaluated, and assessed. These are momentous questions for 
most advocates of the recent standards movement in the United States. 
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Recent research points out some perils of the standards movement. Thomas Sobol argues 
that setting higher standards does not necessarily improve educational outcomes without support 
for teachers and necessary resources. 8 Linda Darling-Hammond emphasizes that higher standards 
undermine access to education for low-achieving students rather than enhancing it. 9 Kennon M. 
Sheldon and Bruce J. Biddle maintain that it entails high-stakes tests, coupled with rewards and 
sanctions, which deprive children of intrinsic interest in subject matter. 10 Patricia Graham points 
out in her historical studies that teaching standards has become the same as “teaching to the test.” 11 
Scott Thompson pays attention to the fact that standards-based reforms are sometimes confused 
with test -based-reforms. 12 

Most research assumes that students’ academic achievements can be ascertained by written 
tests. When we try to assess academic achievements, we usually use paper-and-pencil tests, or writ- 
ten examinations, 13 as is shown in the examinations used in the Programme for International Stu- 
dent Assessment (PISA), National Tests in England, National Achievement Tests in Japan, and the 
National Assessment of Educational Progress (NAEP) in the U.S.A. Although in recent years oth- 
er types of examination have appeared, for example, performance-based assessments, 14 written ex- 
aminations are dominant in most cases. We must be conscious that the written test measures only 
one aspect of academic achievement, nevertheless it constitutes the most important part in high- 
stakes testing. We need to know the effects of using written examinations as a way of assessing 
students’ achievement. 

This paper traces the early history of the written examination and the change of the mean- 
ing of “standards” from the middle of the 19 lh century to the early 20 th century. It will show the 
emergence of external standards given to teachers, and teachers’ responses to external standards. 
By focusing on the written examination for promotion, it is possible to examine the meaning of 
the standards, i.e. what educators expect pupils to achieve at primary school. 


2 Adoption of the Written Examination 

1. Boston School Survey in 1845 

In the 19 th century, there were two kinds of examinations, oral and written. Oral examina- 
tions included recitations, vocal expression, question and answer, or disputation; written examina- 
tions were essay-type, problem-type, and some other objective achievement tests. 15 The latter one 
has spread rapidly since the late 19 lh century. 

Oral examinations were the most prevalent method of ascertaining student achievement be- 
fore the steel pen and the pencil became commonly used in the late 19 lh century. 16 Teachers tested 
students’ memory in a recitation to find out whether or not they had achieved the required standard 
and proficiency. Oral examinations were not impartial or objective in measuring achievement, but 
teachers could judge ability, based on their observation of pupils in their daily work. Teachers 
“knew no qualms in those days about reliabilities or validities or comparability.” It was the “Age 
of Innocence.” 17 

Instead of oral examinations, written examinations, though not popular in the middle of the 
19 th century, came into common use in most schools by the end of the 19 th century. Horace Mann 
(1796-1859), State Superintendent of the Public Instruction of Massachusetts, was the first admin- 
istrator who emphasized written examinations. In 1845, the Boston School Committee conducted 
a survey on the achievements of all students in Writing and Grammar Schools in order to prove 
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the effectiveness of public education and to show that Boston’s schools required no state supervi- 
sion. Oral examinations were usually used for examination at that time, but it was difficult to give 
oral examinations to over 7,000 students. “Written examinations were given, and scored under uni- 
form conditions, and the tabulated results published in detail, question by question and school by 
school.” 18 With all the ineffectiveness revealed by the survey, Boston School Committee published 
every figure. 19 It might be “the first survey of an American system.” 20 

Horace Mann became interested more in the way of inspection than in the results. He crit- 
icized oral examinations and emphasized the merits of written examinations “as a potential useful 
supervisory tool for his office.” 21 He maintained, 

When the oral method is adopted, none but those personally present at the examination can 
have any accurate or valuable idea of appearance of the school... Not so, however, when 
the examination is by printed questions and written answers. A transcript, a sort of Da- 
guerreotype likeness, as it were, of the state and condition of the pupils’ minds, is taken 
and carried away, for general inspection. Instead of being confined to committees and vis- 
itors, it is open to all; instead of perishing with the fleeting breath that gave it life, it re- 
mains a permanent record. All who are, or who may afterwards become interested in it, 
may see it. 22 

Caldwell and Courtis summarized his arguments on the superiority of written examinations 
over oral examinations as follows: 

1 . It is impartial. 

2. It is just to the pupils. 

3. It is more thorough than older forms of examination. 

4. It prevents the “officious interference” of the teacher. 

5. It “determines, beyond appeal or gainsaying, whether the pupils have been faithfully and 
completely taught.” 

6. It takes away “all possibility of favoritism.” 

7. It makes the information obtained available to all. 

8. It enables all to appraise the ease or difficulty of the questions. 23 

The superintendent was seeking impartiality in measuring students’ attainments and effi- 
ciency in administering examinations. He was anxious that “the different committees (would) have 
different standards of excellence,” and that “different standards of judging (would be) in the minds 
of different men.” 24 Written examinations were almost wholly responsible for stopping teachers 
from judging pupils’ achievements to be arbitrary and bringing about administrative standards. The 
Boston School Survey was the beginning of objective measurement. It had suggested the possibil- 
ity that standards would become external to teachers. Answer sheets could be read by someone 
other than teachers, and filed outside of the classroom. 

Written examinations did not become widely popular at that time. However, fifty years lat- 
er, Joseph M. Rice, Edward L. Thorndike, and some other educationists, who advocated science 
of education, promoted written examinations, changed the way of scoring, and used them for mea- 
suring students’ academic achievements. 25 
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2. Development of Writing and Scoring Tools 

Written examinations were not popular until students began using the steel pen instead of 
the quill pen. The first factory for the mass production of the steel pen was built in New Jersey in 
1870. By 1890, students had become used to using the steel pen and to writing answers on paper 
in written examinations. 

Mass production of pencils began in 1866 when Joseph Dixon was granted a patent for his 
machine. Demand for pencils grew, and more than 20 million pencils were being sold annually by 
the 1870s. Prices of pencils declined and quality improved in the late 19 lh century. By the begin- 
ning of the 1900s, the pencil was commonly used in the classroom. Writing on paper with a pen- 
cil was much easier than scratching on slate surface with a special slate pencil. Students could 
write rapidly and continue writing for a long time in class. 26 

At the turn of the century, when such writing tools became popular in the classroom and 
used in examinations, the character of the written examination changed. There existed several types 
of achievement tests in the 1930s. In addition to oral examinations, they were essay-type exami- 
nations, problem-type examinations, and objective achievement tests (new-type). 27 Though ninety 
percent of actual examinations were still essay-type, the use of the new-type objective achievement 
tests was rapidly growing. 28 It would not have been possible without the new writing tools. The 
invention of an electronic-scoring machine by the International Business Machines Corp. in 1936 
made new- type examinations much easier to use in many schools. 29 

We must keep in mind that the development of these technical devices brought about writ- 
ten tests. 30 They made it possible for a large number of children to take the same examination at 
the same time. External examiners, who were not necessarily teachers, could collect students’ an- 
swer sheets and score them objectively and mechanically. Examiners were thus prohibited from 
changing marks due to favoritism for certain students. 


3 Examinations for Promotion 

While grading and promotion gradually spread in city school systems in the 1860s, 31 ex- 
aminations began to have a new function. They became important means to classify children ac- 
cording to a certain standard of achievement, or “grade.” Students who did not reach the standard 
were not promoted, and were sometimes demoted. High-stakes testing, as it is called today, came 
into being. By the end of the 19 th century, examinations for promotion were commonly in use in 
elementary and secondary schools. Both students and teachers could not but be conscious of what 
students should know and could do to pass the examination for promotion. 

According to The Cyclopedia of Education published in 1877, graded schools were usually 
defined as “schools in which the pupils are classified according to their progress in scholarship as 
compared with a course of study divided into grades, pupils of the same or a similar degree of 
proficiency being placed in the same class,” and “promotions should always be based upon a care- 
ful examination ; and, in a graded school, care should be taken that every grade is passed through 
in a legitimate manner, that is, without hurry or cramming.” 32 (Underline added by the author). At 
that time, “grade” meant a child’s degree of academic proficiency, and “promotion” was decided 
by an examination. We can think of “grade” as having almost the same meaning as “academic 
standard,” the topic which we are now discussing. Though examinations were not confined to writ- 
ten tests, they were more likely to be used as examinations for promotion. 
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Until the 1 860s, teachers conducted examinations during regular class work. There existed 
neither examinations for promotion nor grade standards, because schools were ungraded. In the 
late 19 th century, however, as schools were graded, the examination for promotion was introduced 
and given at the end of each grade. The promotion standard was bom with the invention of grad- 
ing. 

Several educators were opposed to examination for promotion. William J. Shearer, Super- 
intendent of Schools in Elizabeth, said, “The teaching test is a necessary part of all true teaching, 
but the promotion examination prevents broad and intelligent teaching, makes out of the teacher a 
grind [.svc] , and turns out machine pupils. It is not a good test either of the ability of pupils or of 
teachers. It is a great temptation to deceit and causes many mental wrecks.” 33 He emphasized that 
teachers should observe students’ daily work carefully, and that the teacher’s evaluation was a mat- 
ter of utmost importance for promotion. 

The examination for promotion had less relation with daily class work than previous ex- 
ams. It was quite different in character from that conducted by teachers during daily class work. 
J. L. Pickard (1824-1914), Superintendent of Public Schools of Chicago, emphasized the distinc- 
tion between the roles of the superintendent and the teacher. 

A superintendent’s examination should be directed rather to the judgment than to the mem- 
ory of the child. Questions can be so framed as to call for opinions rather than facts. The 
teacher’s examination calls for facts more than for opinions. . . . The frequent examination 
by teacher will test memory — the rare examination by the superintendent will measure men- 
tal grasp. ... The teacher’s examination reviews a small portion of text-book work. The 
superintendent’s examination covers a wider field, and fastens upon the child’s mind the 
connection between its parts. ... In arithmetic, the teacher asks how, the superintendent 
rather why — one calling for solution of examples, the other for statement of principles: the 
teacher confines himself to illustrative examples; the superintendent more to principles il- 
lustrated. 34 

Teachers and superintendents assumed different roles in examining children’s academic 
abilities. The teacher’s role was to test children’s memory and give them techniques, while the su- 
perintendent’s role was to develop their minds. 

The principal or the superintendent generally made the decision on a pupil’s promotion. 
Pickard pointed out the evils of divergent judgments between principals, and proposed the “unifi- 
cation of work of different principals.” He continued, 

Much may be done toward this unification of effort by means of the superintendent’s fre- 
quent conferences with the principals. But after all the theorizing before an association, he 
can only determine the practical results by such an inspection of their work as a common 
examination will afford — an examination of such a character as shall present, in questions 
prepared, the outcome of his theorizing. A single test may suffice for several years if it may 
be made to cover the entire ground traversed by the classes presented for promotion. This 
test will be of incalculable advantage as a means of instruction to teachers. 35 

The superintendent chose promotion standards, and teachers were supposed to accept minor 
roles in deciding whether or not a child would be promoted. The standards given by the superin- 


The Origins of the Standards Movement in the United States 


33 


tendent were external to teachers so far as they did not participate in setting them. 

Joseph M. Rice (1857-1934), famous as a popularizer of educational measurement, recog- 
nized the significance of teachers’ roles, but his words implied distrust of teachers. He said, “A 
school system must be judged not by what particularly energetic teachers are, of their own accord, 
willing to do, but what each teacher is required to do in order that she may retain her position.” 36 
Bureaucracy was appearing in the school system. 

In 1912, David Snedden (1868-1951) found the general tendency that “A highly mechani- 
cal system tends to introduce external examinations as a basis for promotion and graduation.” As 
examples of external examination, he referred to the “payment by results” plans in England and in 
American cities during the period from 1870 to 1895. 37 


4 Criticisms of the Written Examination for Promotion 

1. Teaching Examination: E. E. White (1829-1902) 

When the written examination for promotion came into common use in the 1890s, many 
educators, mostly superintendents, began to criticize its defects severely at annual conferences of 
the National Education Association and at other conferences. Some of them conducted a survey on 
examinations, or wrote books and articles on school management to exemplify their arguments. 
They pointed out that the written examination was measuring only verbal accuracy; it would tempt 
pupils to dishonesty; the severe tax of using a pen or pencil on students could lead to physical in- 
jury; teacher would drill facts rather than foster ideas. 38 

Being conscious of these defects, Emerson E. White, Superintendent of Cincinnati, Ohio, 
regarded the distance between the matters dealt with in pupils’ daily lessons and examinations for 
promotion as a serious problem. He made a survey on the situation of the promotion system in 
leading cities in the U.S. in around 1890. He found that “the use of written examinations as a ba- 
sis of promotion (had been) well-nigh universal in graded schools.” 39 There were two factors which 
determined pupils’ promotion: examination results and teachers’ judgment. He gave attention to the 
fact that “the reported union of these factors (was), in some cases, only nominal, most pupils be- 
ing promoted on examination results or class record, the other factor being considered only in 
doubtful cases.” 40 White was not dissatisfied with dismissal of teachers’ judgment, but he was anx- 
ious about the distance between daily class work and the promotion examinations. 

He was opposed to promotion examinations and proposed promotion without examinations. 
He thought that daily evaluations recorded by teachers should be the basis of promotion, and 
said: 

The special purpose of these recorded estimates is to pass judgment on the fidelity and suc- 
cess of pupils during the period for which they are made. They are not primarily judgments 
respecting the fitness of pupils for promotion, but they afford an intelligent basis for such 
a judgment when the time for promotion comes. 41 

For him, “a teacher who does not know the fidelity and success of his pupils in daily work 
is not competent to teach, and should be retired.” 42 

While he criticized the promotion examination, he emphasized the significance of the teach- 
ing examination: 
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Whatever of testing may be needed to determine the proficiency of individual pupils should 
be done mainly by the teacher. If he is competent to teach, he is also competent to test the 
results, and these he must personally know in order to teach.... He must know to what ex- 
tent his instruction has been appropriated by each pupil in his class . 43 

Superintendent White considered the teaching examination “as essential a part of teaching 
as instruction itself .” 44 Teachers must set grade standards and make teaching examinations for 
themselves. Teachers must read essays written by pupils. He would not put the management of ex- 
aminations into the hands of educational administrators. He further stated that, “When the exami- 
nation of pupils becomes an outside business, the high office of teacher may sink into the trade of 
preparing pupils for examination .” 45 

2. Written Recitation: Joseph Baldwin (1827-1899) 

Joseph Baldwin, professor of pedagogy at the University of Texas, was also critical of the 
promotion examination, and tried to abolish “the periodic examination in our schools as the test of 
scholarship and as the standard for promotion.” He substituted the written recitation for “the dread- 
ed test examination ,” 46 namely, the promotion examination. 

The written recitation consisted of oral questions asked by a teacher and answers to be writ- 
ten by students during regular class work. This would occur whenever the teacher deemed it help- 
ful. Pupils did not need to prepare for the recitation, but were always ready for written work on 
their tablets. The teacher read the papers submitted by the pupils at his leisure. The papers were 
returned to the pupils with the mark S (satisfactory), or U (unsatisfactory). The written recitation 
was not so much a test examination as regular work in class. 

The written examination for promotion had no place in Baldwin’s ideology. He argued that 
promotion should follow satisfactory work, saying: 

It (promotion) does not depend on per-cent marks or on test examinations. Pupils study to 
know. They become interested in the work and advance with their classes. As the days go 
by, pupils found able to work more profitably with higher classes are promoted, and pupils 
who prove unable to work with their classes are demoted. As the terms go by, pupils who 
do satisfactory work go forward with the class. The teacher knows. There can be no pos- 
sible excuse for per-cent marking or for test examinations as conditions for promotion or 
graduation; such devices occasion a world of trouble and do incalculable injury. Good man- 
agement and good teaching remedy these evils . 47 

Baldwin was especially critical of per-cent marking, as his words above show. It included 
prizes, honours, test examinations, per-cent record, and per-cent reports. He thought that per-cent 
marking was a hurtful device, and adamantly insisted that it should be abandoned. 

It (per-cent marking) is a low incentive. It magnifies success at the expense of fidelity. It 
fosters a brood of school vipers, such as honours, prizes, and hurtful emulations. It abounds 
in the exact ratio of poor teaching; the poorer the teaching the more perfect the marking. 
Too often per-cent marking proves an antidote to high thinking and moral teaching . 48 


Baldwin did not dispel the examination or the marking. It is true that the “test examination” 
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and “per-cent marking” were objects of his hatred, but he was more interested in good teaching. 
Per-cent marking, which was often thought to be an incentive to study, would only encourage stu- 
dents to aim at irrelevant targets. He believed that good teaching would be sufficient to give chil- 
dren incentives to study. 

3. Regular Class Work and Promotion Examinations 

White and Baldwin had two propositions in common. One was that the written examina- 
tion could be a means for improving regular instruction. They had strong interest in improving 
regular class work, and promotion could not be the aim of instruction. Though White and Baldwin 
criticized the written test for promotion, they did not deny the significance of the written test used 
in class. On the contrary, they emphasized the effectiveness of the teaching examination and writ- 
ten recitation in instruction. 

The other proposition they had in common was that teachers could more or less decide stu- 
dents’ grades and eligibility for promotion. They insisted that promotion should be based on daily 
class work. Naturally, teachers’ judgments were the most important factor in the decision. Promo- 
tion standards could not be outside the jurisdiction of teachers. 

David Snedden in 1912, referring to the situation of grading and promotion after 1895, ob- 
served, “Teachers’ judgment of the pupil’s ability to proceed enters as a factor, as do also formal 
records made of a term’s work.” 49 These words are one piece of evidence to suggest that teachers’ 
judgement and regular class work were important factors for in deciding promotion in the early 
20th century. Whether his diagnosis on grading and promotion reflected the reality of the times is 
uncertain, but it predicted the coming dispute on measurement and standardized testing. 


5 The Standards for Efficiency: Educational Measurement 

1. Standard as norm 

In 1912, Stephen S. Colvin (1869-1923) divided teachers in elementary and secondary 
schools into two classes, those who liked to mark and those who did not. 50 It was too simplistic a 
classification. Neither White nor Baldwin disliked marking. However, when written examinations 
for promotion proliferated and Edward L. Thorndike and some other educational psychologists de- 
veloped numerous standardized tests in “the age of standard, 1890-1915, ” 51 criticisms of examina- 
tions and standards also had considerable influence on classroom teachers. 

It was inevitable that grading actually caused retardation in public schools. Retardation 
meant at that time the failure to be promoted regularly from grade to grade. The number of retard- 
ed students, sometimes called laggards, became an indicator of the efficiency of school systems. 
The fewer the number of laggards, the more efficient was the school system. L. P. Ayres invented 
an “Index of Efficiency” based on numbers of promotion and retardation, and made a list of cities 
arranged in the order of the Index. 52 

In the late 19 th century, the age of Social Darwinism, 53 societal reform was always directed 
toward efficiency. Superintendents, as professional educational administrators, were eager to make 
the school system more efficient. At that time, it was believed that when the levels of efficiency 
between school systems were compared in a competitive society, the standards for promotion 
should be equal in terms of difficulty. Hence, objective measurement and standardized tests were 
developed. 
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Advocates of standardized tests did not have confidence in teachers’ judgment of students’ 
academic ability. Stephen S. Colvin criticized the marking system of the time for being “unsystem- 
atic,” as pupils were marked “for the convenience of the teacher and principal.” Instead, he ex- 
pected “men of scientific ideals and scientific training” to create the methods of objective 
measurement.” 54 What most interested him was efficiency of the school system. He asserted that, 

The school system must be efficient, and how can we know that it is, unless we have stan- 
dards to measure such efficiency? The importance (of this question) ... is shown by the 
prominent place that the question of securing standards to measure educational practices 
was given at the recent meeting of Superintendents in St. Louis. 55 

His usage of the word “standard” is confusing. He did not discriminate between “standards 
to measure ...efficiency” and “standards to measure educational practices.” When he asked for 
“greater exactness” and “completeness” in measurement, standards were the means for acknowl- 
edging efficiency of the school system, not “for the convenience of teacher and principal.” Stan- 
dards were to be made by “men of scientific ideals and scientific training,” i.e. “educational 
experimentalists.” He did not refer to what pupils should know or what teachers expected pupils 
to achieve. 

George D. Strayer (1876-1962), professor of educational administration at Teachers Col- 
lege, more positively acknowledged the relation between standards in business administration and 
measurement of pupils’ achievements. He wrote in A Cyclopedia of Education edited by Paul Mon- 
roe in 1913, 

With the derivation of scales for measuring the achievement of pupils, and with the devel- 
opment of standards in business administration and in organization, it will be possible to 
test adequately the efficiency of a school or system of schools. When such a system of tests 
and standards has been derived in all fields, there will be no excuse for the objection to ex- 
aminations or tests, which one hears so commonly now, because of their inadequacy or un- 
fairness. It is to be expected too that the efficiency of our schools will be greatly increased 
by the application of such measures. 56 

When we look into the Encyclopedia of Educational Research (revised edition, published 
in 1950), we can find “Standards” in its index list, but we are directed to read the item “norms.” 
“Norm” is a statistical term. By the middle of the 20 th century, the word “standard” and “norm” 
were often used interchangeably in educational measurement. 57 Standards were norms to be refer- 
enced to measure efficiency of school systems. 

2. Standards in Curriculum 

Although neither Colvin nor Strayer paid much attention to standards in the school curric- 
ulum, Frank M. McMurry (1857-1929), professor of elementary education at Teachers College, 
famous as a Herbartian, was extremely interested in curriculum and instruction in relation to stan- 
dards. He wrote Elementary School Standards: Instruction — Course of Study — Supervision (1913) 
as one of the School Efficiency Series, edited by Paul Hanus (1855-1941). McMurry set standards 
in relation to aims of instruction. He defined a good curriculum as follows, 
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The curriculum will be good to the degree in which it contains problems — mental, moral, 
esthetic, and economic — that are socially vital and yet within the appreciation of the pupils; 
and its method of presenting that curriculum will be good to the degree in which it exem- 
plifies the methods of solving problems found most effective by the world’s most intelligent 
workers. 58 

He made no mention of promotion examinations, but he discussed the graded course of 
study in detail. Each grade had positive standards. Promotion occurred naturally after pupils 
achieved grade standards. He thought standards were valuable “as tests of habits”; “as tests of thor- 
oughness of knowledge”; “as tests of instruction in the three R’s”; “as sources of suggestion for 
improvement”; and “as tests of the curriculum.” 59 The merit of standards, he thought, was to fix 
attention “(1) on what the children are doing, and (2) on the value of it as judged by its relation 
to the purposes of instruction ,” 60 (Italics are in the original). 

McMurry thought that standards forjudging instruction should include four factors: motive 
on the part of the pupils; consideration of values by pupils; attention to organization of ideas by 
pupils; and initiative by pupils. He discussed ways of applying standards to course of study and 
supervision, and described them in detail. In addition, he gave some recommendations. 61 He held 
the firm conviction that standards should be the means for improving curriculum and instruction. 

McMurry ’s view on standards was not congruent with the preface attached to his book by 
the series editor, Paul Hanus. Hanus was dissatisfied that McMurry did not apply scientific mea- 
surement in judging the value of instruction. Hanus wrote in the preface, 

Scientific measurement in education is, indeed, as yet too little developed to be applied to 
more than a very limited portion of the work of the elementary schools. Except for arith- 
metic and penmanship, ‘standard scores’ or standard achievements are not available for 
measuring the quality of the results actually attained by the schools; and even for penman- 
ship and arithmetic, the standard measures for each grade are not yet firmly established. 62 

Hanus recognized that “the methods of exact measurement (were) still in process of evolu- 
tion,” and wrote McMurry’s conclusions would be “confirmed by exact measurement, so far as it 
was possible to apply exact measurement, by Stuart A. Courtis.” 63 Hanus did not realize that his 
usage of the word “standard” was quite different from McMurry’s. In McMurry’s argument, the 
standards were not “standard scores,” or “standard achievements,” but were “sources of suggestion 
for improvement (of instruction).” 64 Hanus must have thought of Frank McMurry as a giant of the 
old days, as probably did Colvin and Strayer. It was “the age of standards.” 


6 Conclusion 

We have traced the history of standards and the written examination from the 19 th century 
to the early 20 th century. As has been made apparent, the standards movement did not begin with 
the publication of A Nation at Risk. We can find several antecedents in the 19 th century. 

The written examination, which is now the most dominant means for testing students’ aca- 
demic achievements, was first adopted in the Boston School Survey in 1845. Superintendent Hor- 
ace Mann found the written examination to be impartial, objective, and free from arbitrary judgments 
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of teachers. It was the beginning of objective measurement of academic achievements. External 
standards were ready to materialize. 

The common use of the steel pen, pencil, and paper, which occurred in the late 19 lh cen- 
tury, popularized the written examination. Written examinations were used to test students’ achieve- 
ments in grading and promotion. Hence, external examinations and promotion standards were 
developed. They were made by educational administrators and teachers were expected to accept 
them. 

Some educators were vehemently opposed to written examinations for promotion. What 
they feared most was the distance between instruction and examination. They were interested in 
improving regular class work. They thought examinations for promotion did not contribute to class 
work and actually hurt pupils. Promotion standards were closely connected with their daily class 
work. 

The period between 1890 and 1915 was the age of standards. Educational measurement, in 
the name of science of education, increased rapidly. Many school surveys were conducted. Some 
educational administrators made lists of cities ranked in the order of the “Index of Efficiency.” 
Standards became the means to measure efficiency of education. They had a close relationship to 
achievement as measured by the standardized test, but lost their relation to the aims of instruction. 
Standard had almost the same meaning as “norm.” 

The educational measurement movement did not continue for a long time. It was criticized 
by many educators, especially progressives. There is numerous research concerning the conse- 
quences of the measurement movement. 65 Long after the heated controversy in the middle of the 
20th century, the standards movement once again arose in the 1980s. Yet, supporters of the stan- 
dards movement are still struggling to solve the same problems that White, Baldwin, and McMur- 
ry tackled a century ago, most notably, how best to connect instruction and examination. The 
answer to this question may well lie in unification of the two. 

The experience of educators alive a century ago shows us that the written examination cre- 
ated distance between instruction and examination and made standards external to teachers. We 
must now re-establish the connection that previously existed between them. How can this be pos- 
sible? 

We are now using various media and methods for assessing students’ academic achieve- 
ments. These can be useful means to connect examination and instruction. By using computers, e- 
mail and other writing, reading and communicating tools, we can easily know students’ responses 
in instruction. Performance-based assessment can show students abilities which paper-and-pencil 
tests cannot measure. 

However, the early history of the standards movement implies that these new media and 
methods do not necessarily re-establish personal relationship between the teacher and the pupil, as 
long as academic achievements of children are compared with standards external to teachers. In- 
struction does not occur without participation of teachers in setting standards and aims of instruc- 
tion. 
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